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Abstract 

Width parametcrizations of SAT, such as tree-width and path-width, enable the study 
of computationally more tractable and practical SAT instances. We give two simple algo- 
rithms. One that runs simultaneously in time-space (0*(22'^i^('?^)), Q* (2'^^^'>''>)) and an- 
other that runs in time-space (O*(3'7"W(0)iog|0|-)^ where TWicj)) is the tree-width 
of a formula (/) with many clauses and variables. This partially answers the question 
I of Alekhnovitch and Razborov |AR02j . who also gave algorithms exponential both in time 
^ ■ and space, and asked whether the space can be made smaller. We conjecture that every 
algorithm for this problem that runs in time 

2rw(0)o(iog|0|) necessarily blows up the space 

to exponential in TW{(j)). 

We introduce a novel way to combine the two simple algorithms that allows us to trade 

■ constant factors in the exponents between running time and space. Our technique gives rise 
I to a family of algorithms controlled by two parameters. By fixing one parameter we obtain 

an algorithm that runs in time-space (o*(3i-44i(i-e)rw(0) log |0| -j^ q* (^2^erwi4>)-^^ ^ for every 

Q I < e < 1. We systematically study the limitations of this technique, and show that these 

algorithmic results are the best achievable using this technique. 

We also study further the computational complexity of width parametcrizations of SAT. 
^ ■ We prove non-sparsification lower bounds for formulas of path- width a;(log |0|), and a sep- 

! aration between the complexity of path-width and tree-width parametrized SAT modulo 

OO ■ plausible complexity assumptions. 

m 

00 ■ 1 Introduction 
O . 

Satisfiability (SAT) is the prototypical NP-complete problem extensively studied in theoretical 
and empirical works. Previous work in SAT-solving deals with exact algorithms, special cases, 
heuristics, and parameterizations. In particular, width-parameterizations have received signifi- 
^ ' cant attention. Consider a formula cp in Conjunctive Normal Form (CNF), and also fix a graph 

■ describing its structure. Previous research gave algorithms with running time exponential in 
width parameters, e.g. tree-width, measured on this graph. One reason to care about this is 
because many real-world instances tend to have small width. In this paper we take a further 
step and we study time-space tradeoffs for width-based SAT-solvers. 

In an influential paper Alekhnovitch and Razborov |AR02j gave algorithms that work in 
time in space 

20(TWW)^ where TW{^) is the tree-width of a CNF formula (/>; 
assume that TW{(j)) = ^^(log |(/)|). The authors state their results in terms of the branch- width 
of the formula, which is within a constant factor of the tree-width. They conclude: 

" The first important problem is to overcome the main difficulty of the practical im- 
plementation which is the huge amount of space used by width-based algorithms... 
Thus we ask if one can do anything intelligent in polynomial space to check satisfi- 
ability of formulas with small branch-width? " 
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The question raised by Alekhnovitch and Razborov is a major issue in practical SAT-solving. 
It is well-known in the SAT-solving community that in many common cases SAT-solvers abort 
due to lack of space. 

We devise two baseline algorithms for SAT instances in CNF. These two algorithms will be 
used later on as building blocks of much more involved ones. One is similar to |AR02j . and it 
runs in time 0*(22'7^WW) and space 0*(2'^^W). The other runs in time 0*{3^^'^^^^°^\^\) and 
space |(/)|*^^^^, and it is the first algorithm for deciding arbitrary CNF instances that runs in 
space polynomial and time related exponentially in the tree-width. Unfortunately this does not 
fully answer the jAR02j question since we suffer a log \ factor in the exponent of the running 
time. In fact, our work revolves around this logarithmic factor. 

Conjecture. Let A be an algorithm for SAT that runs in time 0*{2^^'f'^^'-^'f'^^). Consider 
CNF formulas where TWicj)) = a;(log \(p\) and T>V(0) = 0(|(/>|^"'), for arbitrary fixed e < 0. If 
5{4>) = o(log \ then A uses space 2^^'^'^^'^^\ In particular, we cannot achieve simultaneously 
time exponential in the tree-width and space polynomial in the input length. 

Under this conjecture, for all practical purposes it makes sense to devise algorithms that 
improve the constant in the base of the running time and space from 3 and 2 to constants 
smaller than 3 and 2 respectively. At a more systematic level one might want to obtain time- 
space tradeoffs between constants in the exponents of the running time and space. A significant 
part of our contribution regards families of algorithms that achieve such tradeoffs. 

Throughout this paper we assume that the tree (or path) decompositions are given in the 
input. That is, we mod-out the computational difficulty of computing the decomposition. This 
is without loss of generality for our algorithmic results since there are constant approximation 
algorithms for computing such decompositions in time exponential in the tree- width and space 
polynomial in the input length (e.g. |AR02j ). Moreover, from a complexity theory point of view 
this is the "correct" thing to do. In particular, when the width decomposition is given in the 
input, under standard complexity assumptions we show that deciding SAT of a given a tree 
decomposition of width W is harder than deciding SAT of a given path decomposition of the 
same width value W. 

Related work Tree-width is a popular graph parameter introduced by Robertson and Seymour 
[?. IRS86] . The smaller the tree- width of a graph, the more the graph looks like a tree (in some 
topological sense); for a graph of n vertices tree- width 1 means that the graph is a tree, whereas 
treewidth n — 1 means that it is the complete graph. There is a handful of hard computational 
problems on general graphs which become computationally easier when the input graph is of 
small tree-width; see. e.g. [?] for a survey. For SAT instances the tree-width of a CNF formula 
is the tree-width of its associated graph: incidence graph, primal graph, intersection graph and 
so on. Among those graphs, the most general one is the incidence graph (a bipartite graph 
where one side has variable-nodes and the other clause- nodes) . In some sense, the tree- width 
value on the incidence graph upper bounds the tree-width value of the rest |Sze04j . There is a 
vast literature (too large to concisely cite here) in empirical and theoretical studies in various 
width-parameterizations of SAT. 

Improving the constant in the basis of an exponential time algorithm is a well-established 
goal in the field of exact computation for NP-hard problems; see e.g. |Woe03j for a survey on 
problems, algorithmic techniques, and see references within. In particular for fc-SAT there is 
a line of work in algorithms that run in time a" for a < 2; e.g. |PPSZ98l ISch991 IWoe031 ?]. 
Somewhat related to the threshold phenomenon conjectured early in this section, there are 
vertex-ordering NP-hard problems which can be solved in time-space (0*(2"), 0*(2"')) and in 
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time-space (0*(4"),n'^(^)); e.g. [BFK^ll] and references within (an in particular [?]). There is 
no easy way to adapt these technique in our case (and thus to get rid of from the exponent the 
logarithmic factor). A key property of these algorithms is that is that as smaller subproblems 
are created the smaller the parameter (number of nodes) becomes. There is no obvious way to 
achieve this when the parameter is the width of the formula. Compare this to our conjecture. 
Our conjecture is about the width of a SAT instance per se - furthermore, in the worst case 
SAT can be exhaustively solved in time 0*(2") and space n'-^^^\ 

Prior to our work, |GP08j addressed the question of Alekhnovitch and Razborov. The au- 
thors gave a combinatorially non-explicit algorithm only for the A;-SAT problem, where the 
algorithm runs in time 2'-^^'^^^'^^^°^^'^^^ and space |(/)|'^'-^\ Due to the non-explicitness the con- 
stant in the exponent of the running time cannot be bounded in some easy way. 

|Pap09| shows that the complexity of deciding path-width parameterized instances precisely 
corresponds to the streaming verification (in log-space) of NP-witnesses. In particular, it is 
shown that deciding formulas with path decompositions of width O(logn) is complete for NL 
and it is asked whether the complexity of SAT instances with tree decompositions of width 
O(logn) is more difficult. 

Lower bounds for deciding path-width parameterized SAT can be easily derived under the 
Exponential Time Hypothesis (ETH) and the application of the Sparsification Lemma |IPZ98j . 
For the more general case of Constraint Satisfaction Problems, ETH has been applied in techni- 
cally beautiful developments to essentially show that the time-optimal results are the standard 
tree-width based algorithms; see e.g. |Gro071 iMarlO) . 

The last question on the inherent complexity of width-parameterized SAT instances regards 
their sparsification. In a model where a polynomial time verifier is given a formula <j) of path- 
width VW{(j)) and it communicates with an all-powerful oracle, how many bits can the verifier 
send to the oracle to decide (pi This question has been addressed before (e.g. |FS081 IDvMlOj 
- see |DvM10j for references) for NP-hard problems and in particular for SAT. In particular, 
for 3-SAT |DvM10j conditionally shows that if the verifier and the oracle communicate using 
n^"*^ bits, then this is not sufficient to decide satisfiability. 

Our contribution and techniques We give a dynamic programming (DP) algorithm for SAT 
where given a tree decomposition of the incidence graph of width 7'W(<?!>) runs in time-space 
(0*(22'^^W), O* (2^1^ W)), and a recursive algorithm that runs in time-space (O* (S^"*^*-*^^ ^^^), 
|,^|0{i)) (Section [3]). The latter algorithm is the first space-efficient algorithm for width- 
parameterized SAT. In some sense, we are doing even harder work than |AR02] . since the 
underlying graph in that paper is the primal graph. If we combine the DP and the recursive al- 
gorithms in the obvious way, then we gain the worst of both worlds. Here "obvious" means that 
we discretize the space of truth assignments during the execution of the recursive algorithm and 
combine using DP. Instead, we introduce an implicit infinite family of proof systems. We use 
two free parameters to specify an algorithm in this family. One parameter is an integer greater 
than 2. This controls the "complexity" of the rules applied, for performing an unbalanced type 
of recursion of some sort. The larger this parameter is the more space and the smaller the 
running time is. The second parameter is a real number in (0, 1) that controls the discretization 
of the truth assignment space. This family of algorithms is presented in Section 21 In the same 
section we show that all infinite pairs of values are of interest depending on different time-space 
bounds one may want to achieve. 

Section [5] contains some preliminary complexity theory results for width-par ameterizations. 
We show that the problem SATtw, where the CNF formula is given together with the tree 
decomposition is computationally harder than the problem SATp^ where the CNF formula 
is given with a path decomposition of the same value. In particular, SATtw for tree-width 
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0(log|(/)|) is harder than SATp^ of path-width 0(log|(/>|), unless NL = SAC^, a standard 
complexity assumption (e.g. |BCD"'"89] ). Note that this is not true in general for other width 
parameters. For example, although path-width and band-width combinatorially may be off 
by an exponential, under log-space transformations they behave the same |GP08j . We also 
show that there is no trivial way to sparsify SATtw unless a scaled and non-uniform version of 
NP / coNP fails. 

2 Preliminaries 

We introduce notation, terminology, and conventions used throughout the paper. We also 
provide a rather elementary introduction on how an algorithm may exploit the structure of 
bounded tree- width formulas. 

2.1 Notation 

All logarithms are of base 2, and all propositional formulas are in Conjuctive Normal Form 
(CNF). SAT is the decision problem where given an arbitrary CNF formula we want to decide 
if it is satisfiable. A;-SAT denotes the restriction of SAT to CNFs where each clause has at 
most k literals. For a formula (p, m denotes the number of clauses, n the number of variables, 
and Cj and Xj stand for the i-th clause and j-th variable respectively. For convenience we write 
\(f)\ = m + n. The notation O*, 0,* and 0* suppresses polynomial factors. 

2.2 Tree-Width 

Definition 1. Let G = {V,E) he an undirected graph. A tree decomposition of G is a tuple 
(T, X), where T = (VF, F) is a tree, and X = {Xi, • • • , where Xi <^V s.t. 

(1) uHx. = V 

(2) yii,j)eE, 3teW, s.t. iJeXt. 

(3) \/i, the set {t : i G Xt} forms a subtree ofT. 

each of Xi is called a bag, the width of (T, X) is defined as maxt^wl^tl — 1? o.nd the tree-width 
TW{G) of graph G is defined as the minimum width over all possible tree decompositions. 

When the tree decomposition T = {W, F) is restricted to a path, the decomposition is called 
path decomposition^ and the specific tree- width is called path- width 'PW(G). The following 
inequality holds ( |Bod98j ) 

TW{G) < VWiG) < 0{log\V\TWiG)) 

Definition 2. The incidence graph G^j, of a SAT instance (j) is a bipartite graph, where in 
one side of the bipartization each node is associated with a distinct unsigned variable, and in 
the other each node is associated with a clause. There is an edge between a clause-node and a 
variable-node if and only if the variable appears in a literal of the clause. The tree-width of a 
formula (j) is the tree-width of its incidence graph, TW{(j)) = TW{G^). When it is clear from the 
context we may abuse notation and write TW((/>) to denote the width of a given decomposition 
of G^. 

We assume that a tree decomposition of the incidence graph of (j) is given as input along with 
<j). For convenience, we assume the input tree decompositions have the following two properties. 
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(1) \w\ = o{rw{^) ■ \v\) = o{rw{m\) 



(2) The tree T has bounded degree 3. 

Tree decompositions satisfying the two properties are cahed nice. A tree decomposition can 
be converted to a nice one in Hnear time ( [K"lo94] [Bod98j ) . Notation d is used to denote the 
maximal degree in the tree decomposition. By the property above, d < 3. When the input is 
given with a path decomposition, d is actually upper bounded by 2. 

Remark 1. The parameter d affects the performance of our algorithm significantly, to fully 
exploit the structure of the input decomposition, we prove most of our results parameterized by 
d. One may replace it by 2 or 3 when the structure of the input decomposition is guaranteed to 
be a path or a tree. 

2.3 Truth assignments, assignments, and tree decompositions 

The structure of a tree decomposition is associated with the concept of separability (e.g. |Bod98j ). 
Intuitively, the smaller the tree- width is, the easier the graph can be broken into separate compo- 
nents by removing nodes. It is the separability that allows us to device more efficient algorithms 
for small tree-width SAT, than for general SAT. In some sense, the given tree decomposition 
allows us to "localize" the exhaustive search. The following example sheds some light on how 
this can be done. For the sake of simplicity, we make an additional assumption on the tree 
decompositions given in the input, that all the variables of a clause appear in the same bag 
with the clauses. We will see later that removing this assumption is non-trivial. 

Suppose Xj's, x'j's and s are different sets of variables, and the tree decomposition is as 
in Figure [Tal Some clauses depending only x^'s or only x^"s are not drawn explicitly but are 
placed in the bags as indicated. 




other clauses on x'-'s other clauses on x';"s other clauses on xj's other clauses on x';"s 

(a) Input tree decomposition. (b) Fixing an assignment to the variables in 

the middle bag results in two independent 
instances. 



Figure 1: An example showing bounded tree- width SAT can be solved efficiently 

Suppose that we fix a truth assignment to the variables in the bag in the middle, e.g. 
xi = X2 = X3 = X4 = 1. Conditioned on this truth assignment, we can simplify the instance by 
removing clauses that are already satisfied, and removing literals in a clause which are set to 
false. This will result-in multiple subproblems as shown in Figure [Tbl Assured by the property 
of a tree decomposition, the subproblems depend on different set of variables, i.e. they are 
independent. Since if instead they shared a common variable, this variable must also wuld have 
appeared in the middle bag, e.g. X2. At this point this variable must have been fixed to a truth 
assignment, thus removed in the simplification procedure. 
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The satisfiability of the input instance, conditioned on the truth assignment given to the 
middle bag, is determined by the satisfiability of the two separate subproblems. Therefore, it 
suffices to enumerate all truth assignments satisfying all the clauses in the middle bag without 
causing empty clauses in the simplification phase. Then, solve the two resulting subproblems 
separately to decide the satisfiability of the original instance. Furthermore, this "splitting" 
operation can be invoked recursively, by carefully choosing the "middle" bag. 

In each recursive step, the most time-consuming part is to enumerate all the assignments 
satisfying all the clauses in the chosen bag, which costs 0*{2^'^^'^^^"^^'^^) time, and the total 
running time is 0*(2^^('^)^°§l'^l), which is much better than the currently best algorithms 
for general SAT which run in the exponential in n. The algorithm described above will be 
formalized as the space-efficient algorithm in Section [3.41 

The subtle additional assumption The assumption that all variables of a clause appear in the 
same bag with the clause is not a mild one (especially for CNFs of large cardinality). In general, 
we may have to delay the decision to satisfy a clause. In the above algorithm, we only store 
the truth assignments to the variables. The following example shows that only storing this 
information is not enough, when aiming at removing the assumption. 




(a) 4>i (b) (1)2 (c) (p3 



Figure 2: Three instances used in the example. Figures on the top are the input tree decompo- 
sitions, the bottom figures are the two components after fixing assignment to the variables in 
the middle bag. 

Suppose Ci = xi V a;2 V X4 V xq, C2 = xi V X3 V X5, C3 = X2, C4 = X3, C5 = X4, Cg = X5 
and C7 = Tq. Three instances (pi, (p2 and (/>3 along with their tree decompositions are given 
in Figure El where (/>i = Ci A • • • A C7, </)2 = Ci A • • • A C5 A C7(i.e. Cq is missing), and 
(/)3 = Ci A • • • A C4 A Cg A (77(1. e. C5 is missing). We say that a clause is satisfied by a literal 
under a truth assignment if the literal appears in the clause and is set to 1. If an instance is 
satisfiable, then there is a truth assignment where every clause is satisfied by one of its literals. 

Now, consider the splitting operation on the middle bag by fixing a truth assignment to it 
as above. For all three instances, the only possible assignment for xg is 0, since C7 must be 
satisfied by xg = 0. Similarly, in the left bag, we must assign X2 = and X3 = to satisfy C3 
and C4. In the left bag, the only variable left is xi, which can satisfy either Ci or C2, but not 
both. The three instances differ in the right part, where two variables X4 and X5 are left. 

Satisfying C5 requires X4 = 0, then Ci can not be satisfied by X4. Similarly, satisfying 
Cg requires X5 = 0, then C2 can not be satisfied by X5. In order to find a satisfying truth 
assignment, when processing the right part, we need the information which of Ci, C2 is already 
satisfied in the left part. 0i is not satisfiable, so whichever does not affect the result. (^2 is 
satisfied only when Ci is already satisfied, while (^3 is satisfied only when C2 is already satisfied. 
This piece of information is not carried through the middle bag by just the truth assignment to 
the variables. 

To overcome this issue we are going to use "clause-bits". In fact, the semantics of these bits 
is a non-obvious issue which significantly affects the running time of our algorithms. 
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Notation and terminology We introduce terminology and notation to talk about truth assign- 
ment on bags. Let X be a bag in the tree decomposition, V be the variables and C be the clauses 
appear in X. Also, ny = |V| and mc = \C\. An assignment Rx for X is a binary vector of 
length n\; + mc. The first ny bits indicate the truth values of the corresponding variables. Note 
that the term "assignment" does not correspond only to a "truth assignment" on the variables 
in X. It is an assignment of bit values both to variables and to clauses. 

What values the last mc bits have is a subtle issue explained in Section [3l For the first, 
dynamic programming algorithm, things are pretty clear. However, for the space-efficient and 
trade-off algorithms, things become more subtle. Intuitively, a bit corresponding to a clause C 
is 1 if we "have decided" to satisfy this clause (this has to do at which part of the execution of 
the algorithm we are), and it is different for different algorithms. 

Actually, the most straightforward way of defining the clause bits is to let it denote whether 
the corresponding clause "is" satisfied. To ensure that a clause is satisfied in one of the branches 
in the tree decomposition, we need to enumerate all 2'^ — 1 combinations that on which branches 
the clause is satisfied. However, if one is interested in only in the satisfiability problem (and 
not e.g. in #SAT) we observe that only d combinations can do the job. 

3 Basic algorithmic results 

We give the two basic algorithms. These serve as building blocks for the algorithms in Section 
m The first baseline algorithm (Section 13. ip is doing dynamic programming, and it runs simul- 
taneously in time-space (0*(22'^^W),0*(2'^^W)). The way this algorithm goes is standard 
in the literature of algorithms that compute using a tree decomposition [?]. The second algo- 
rithm is recursive and it runs in time-space 

(|Q*(3rw(<A)iog|0|)^ \^\0(i)y Before presenting this 
space-efficient algorithm we introduce more terminology and lemmas (Sections 13.21 13. 3p dealing 
with truth assignments on tree-decompositions. This level of generality is not necessary if it 
is only used for the space-efficient algorithm. Full use of this generality is made in Section S] 
where we give the time-space tradeoff algorithms. 

3.1 Time-efficient algorithm 

A binary array satisfiability[-, •] indexed by X and Rx is defined, where X is the root of the 
subtree in the tree decomposition, Rx is an assignment to X. satisfiability[X, = 1 means 
that there exists a satisfying assignment to the subtree rooted at X, such that the assignment 
to X is Rx- 

The values of the array can be computed in a bottom-up fashion. When computing values 
for a bag X, let Xj's be its children, satisfiability[X, Rx] is set to 1, if there exist -Rx/s consistent 
with Rx such that satisfiability[Xj, = l,Vz. Consistent means: bits corresponding to the 
same variable in Rx and in i?Xi's are the same; if a bit corresponding to a clause C in Rx is 
assigned 0, or is assigned 1 and C is satisfied by a variable in X, the bits in all Rxi's for C are 
set to 0, otherwise, the bits in all -Rx/s for C are set to 1. Note that the notion of consistency 
here is different than the notion introduced in the next section. 

Suppose Xr is the root of the tree decomposition, if satisfiability[X,., = 1 some 
RxrJ then the instance is satisfiable. This can be proved by induction on depth of the tree , 
together with the construction of the satisfiability table and the property of a tree decomposition, 
satisfiability table requires O* {2^'^^'^''') space. Filling the entries as described above requires 
0*[2^d-i)TWW^ time, where d is the maximum degree in the tree decomposition. Recall that 
we have assumed a normal form on the tre decomposition where d = 3. 
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3.2 Splitting Node 



The operation of splitting the tree at a node is an essential step for the space-efficient and 
tradeoff algorithms. 

Definition 3 (splitting operation). Let T = {V,E) be a tree, and v gV. Splitting T at f is the 
following operation. Let Ti, . . . ,Tk be the trees after removing v from T. The splitting operation 
results-in a forest {v} U Ti, . . . , {v} U T^, where {v} U Tj is the subtree induced by the nodes in 
Ti together with v. We call v the splitting node of this operation. 

The node at which we split a tree is labelled as a splitting node. Given a tree T together 
with a sequence of splitting operations results-in a forest where each subtree in the forest in 
general has many splitting nodes. 




Figure 3: Two splitting operations at the black shaded nodes 

The following Lemma [T] is somewhat reminiscent to the well-known "5 - | lemma" for 
binary trees. Lemma [T] together with Corollary[Tl ensures that it is possible to efficiently choose 
a balancing splitting node in a tree of constant degree; i.e. a splitting node where the sizes of 
the trees in the resulted forest is linear in the number of nodes of the tree. 

Lemma 1. Consider a tree of size N, a leaf s and < a < 1. Then, there is a node p, where 
one of the trees containing s resulted after splitting at p is of size < \aN~\ and each of the rest 
is of size < \{1 — a)N~\ . p is called an a-splitting node. Furthermore, such a p can be found in 
time polynomial in N. 

Proof. Here is an algorithm for finding p. Root the given tree at s and construct a path 
(s = vi,V2, . . . ,vi) as follows. At step i, among the children of -yj-i let vi be the root of the 
largest subtree. We claim the there in vj in this path with the desired properties. 

Denote by the size of the subtree containing s after splitting at Vi. It is obvious to see 
that ai = 1, ai = N, and strictly increases as i increases. Therefore, there must be a j, such 
that, Oj < aN and Oj+i > aN. We claim that Vj is the node we need. If flj+i — Oj = 1, then 
cutting Vj will result in two components, where the size of the component containing s is \aN~\ , 
while the other one is of size [(1 — a)N~\ . If aj+i — aj > 1, then there must be a branch at vj, 
meaning that Vj has at least two children. Splitting at Vj results-in at least three components, 
the one containing s is smaller than aN, and the largest one among the others is smaller than 
(l-a)iV. □ 

Corollary 1. On a bounded- degree tree of size N, there exists a nodep, such that after splitting 
at p each subtree is of size at most . 
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3.3 Splitting nodes and assignments 

Consider a tree decomposition and a sequence of splitting operations. This process breaks the 
original tree to a forest where each subtree ha its splitting nodes. We refer to an assignment 
on a subtree as the assignment that corresponds only to its splitting nodes. Let T be a subtree 
with splitting nodes S. For a tree T with splitting nodes S, splitting at a node p results in 
multiple subtrees {71} , each Ti with its splitting nodes Si. Denote by X* = U^^g^Xj, and let V 
be the variables and C the clauses which appear in X*; this is the set of variables and clauses 
on which we define assignments. Suppose Rj- is an assignment to T, and Rj-. is an assignment 
to the subtree 71. Rt and all the i?7-'s are said to be consistent if 

(1) for every i, the bits corresponding to a variable x in Rj;. is the same as in Rj- 

(2) for a clause C, 

a) if C appears in X* and is assigned 0, then Vi every bit for C in Rj-^ is assigned 

b) otherwise, 3 exactly one i such that in Rj;. the bit corresponding to C is assigned 1. 

Remark 2. The latter point in the definition, where in exactly one of the subtrees we require 
that the corresponding bit equals to 1, is somewhat subtle. Note that it has a significant effect 
in the running time of the algorithms. The following lemmas crucially depend on this issue. 

The following two lemmas upper bound the number of assignments in two different situa- 
tions. In what follows we assume that there is an initial tree decomposition together with a 
sequence of splitting operations that result-in the subtrees along with their splitting nodes. 

Lemma 2. For a tree T with splitting nodes S, the number of assignments is at most 2^^^''^^'^h 

Proof. \X*\ < Ylvi^s — l'S'|'7'W((^), the number of variables and clauses in X* are at most 
\S\7'yV{(f>). So the number of assignment is at most □ 

Lemma 3. For every assignment TZj- to the tree T , the number of assignments R-j-. to subtrees 
7i 's consistent with TZj- is at most d"^^^^ . 

Proof. Let Xp be the bag corresponding to the splitting node p. For each variable x in the bag 
Xp, there are 2 possible assignments of x in the subtrees {Ti}. For each clause C in Xp, if C 
appears in Rj- and is assigned 0, by the definition of consistency, each appearance of C in the 
{7i}'s is assigned 0. Otherwise, in exactly one {%} C is assigned to 1; in this case there are at 
most d valid assignments. □ 

Definition 4. For a tree T with splitting nodes S, an assignment Rj- is satisfying if there 
exists a truth assignment A to every variable in 7' , such that 

(1) every truth value for a variable in Rj- agrees with the corresponding value in A. 

(2) every clause C that appears in T where C does not appear in S, is satisfied by A. 

(3) every clause C that appears in S and is assigned 1 by Rj- is such that C is satisfied by A. 

The following lemma shows how to determine the satisfiability of an assignment recursively. 

Lemma 4. An assignment Rj- is satisfying if and only if there exist assignments R-ji to the 
subtrees %, such that the assignments Rj: are consistent with Rj- and each of the Rj: are 
satisfying. 
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Proof. For a tree T with splitting nodes S, suppose that sphtting at node p results-in the several 
subtrees {%}■ 

Suppose that the assignment Rj- is satisfying, by Definition [U there exists a truth assign- 
ment on variables within T. Using the truth assignment, we can always find assignments R-ji 
consistent with R-j-, such that for these truth assignments the conditions in Definition H] are 
met. 

For the other direction suppose that there exist assignments Rji of the subtrees 71, such 
that the assignments R-ji are consistent with Rj- and all RJ^ are satisfiable. For each subtree 
7i, there exists a truth assignment complying to Definition [H Since all these truth assignments 
agree with on their common variables, we can get a truth assignment from their union, which 
also meets the axioms in Definition HI Therefore, the assignment Rj- is satisfiable. □ 

3.4 Space-efficient algorithm 

The space efficient algorithm is based on the observation that in every degree-bounded tree we 
can always find a node, such that by splitting at that node every subtree has size no more than 
half of the original tree (Corollary [T]). 

Then, the recursive algorithm works as follows: find the splitting node, fix the assignments 
for all subtrees, and then recurse on the subtrees after the splitting. The algorithm is summa- 
rized in Algortihm [TJ T is a tree with previous splitting nodes S, and Rj- is the assignment 
fixed on the tree. A subtle point that affects the running time of this algorithm is addressed in 
Remark [2j 

Algorithm 1 SAT(r, i?r) 
1: if every nodes in T are previous splitting nodes then 

2: if every clause in Rj- which assigned 1 is satisfied by some variables in T then 
3: return true 
4: else 

5: return false 
6: end if 
7: else 

8: find the splitting node s, and cut at s, which result in many subtrees Tj 

9: for all assignments Rj: consistent with Rj- do 
10: if for each subtree %, SAT(7i, R%) = true then 
11: return true 

12: end if 
13: end for 
14: return false 
15: end if 



Regarding its correctness, after the splitting, by the property of a tree decomposition, all 
the variables shared by different components must have been fixed at the splitting node, namely 
there will be no consistency problem among the components. By induction, it can be shown that 
SAT{T',A) corresponds to the satisfiability of the (sub)tree decomposition T consistent with 
all assignments A. Therefore, SAT{To,^), where To is the input tree decomposition, corresponds 
to the satisfiability of the instance. 

This algorithm requires only |</>P^"^^ space, because there are only 0(log \<j)\) assignments to 
be stored during the process. The running time can be written as follows. Suppose T(N) is the 
running time on a decomposition with N nodes. By Lemma [3] 
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T{N) < O (d-^^*)) T i^^Nj + 
that is, = 0*(d^^^''^'*'°sl'^l), where by the normal form assumption d = 3, i.e. = 

4 Algorithms for time-space tradeoffs 

If we combine the baseUne algorithms of the previous section in the obvious way by considering 
blocks of truth assignments, we obtain the worse of the two worlds (do you see why?). In 
this section we establish Theorem [1] below, by exhibiting a family of algorithms that achieve 
time-space tradeoffs. To that end, we introduce a new algorithmic technique in which we make 
non-black-box use of the two simple algorithms. Each algorithm in this family is identified 
by two parameters (e, c). Moreover, we show that both of these parameters are necessary to 
achieve different time-space tradeoffs. Parameter < e < 1 corresponds to the granularity of 
the discretization of the assignment space, whereas the integer parameter c > 2 has to do with 
the "complexity" of the rule applied recursively. 

Theorem 1. For every integer c > 2 and e, where < e < 1, a SAT instance (j) with a tree de- 
composition of width TyV{(p) and N nodes, can be decided in time O* (3{-'*c(iog^-c)+c)(i-e)T>V{0)^ 

and in space O* (2'^'^''"^'^'^)) for a constant Ac- 

Ac is a constant depending on c. To be more specific, Ac is defined as — logXc, where Xc is 
the root with largest absolute value of the polynomial equation: X'^ — X'^~^ — X'^~'^ — ••• — 1 = 0. 
Values of Ac for small c's are listed in Tabled] 



c 


2 


3 


4 


5 


6 


Ac 


1.441 


1.138 


1.057 


1.026 


1.013 



Table 1: Ac for small c's 



4.1 Splitting Depth 

A splitting algorithm ^ on a tree, computes a function where given a tree T together with 
previous splitting nodes S, it returns a node where the next splitting operation is going to be 
performed. 

Definition 5. c-splitting depth SDc(^, T, S) of a splitting algorithm A on tree T with previous 
splitting nodes S is inductively defined as follows: 

( max(7-„,5„)gCr^^SDc(Aro,5o) + l \S\ < c,\S\ < \T\ 
SDM,T,S) = \ |S| <c,|S| = |r| 

[ oo |S| > c 

where p is the output of A on T and S, C-j-,s,p is the set of subtrees by splitting at p in tree T 
with previous splitting nodes S 

Intuitively, splitting depth is the recursion depth of the splitting algorithm, c-minimal 
splitting depth MSDc{T, S) is the minimum value of SDc{A, 7~, 5), over all splitting algorithms. 
The case of oo is for well-definiteness. 
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4.2 Assignment Group 

For a tree T with previous splitting at nodes 5, splitting a node p results in several subtrees {71}, 
each 7i with splitting nodes Si. By Lemma EJ there are at most 

2|5|rw(<^) different assignments. 

An e-assignment group (e-GR-r) is a set of binary strings of length at most |S'|7'W(</>), in which 
(1 — e)TyV{(p) entries corresponding to each previous splitting node are fixed to some constant 
value. 

Consider the case of a tree T with previous splitting nodes S. Suppose S contains 4 variables 
xi, X2, X3, X4 and 3 clauses Ci, C2, C3. An example of e-GRj- is as follows : xi, X4, C2, C3 are 
fixed to 1,1,0,1, and X2, x^, Ci are not fixed. The e-GRj- contains 8 = 2'^ binary strings. 

e-GRj- and e-GR-ji 's are said consistent if and only if there exists a way to assign a value to 
each of the unfixed values in T(as Rq-) and 7i's(as -Rt^'s), such that R-j- and R^s are consistent. 

For a tree T and e-GR-j-, fix (1 — e)7'W(</>) bits correspond to variables and clauses contained 
in the splitting node p, one can derive e-GR-j-. for each subtree Ti- Note that the fixed entries 
for the splitting node p may be different among subtrees, and the unfixed entries in T need 
not to be fixed in subtrees. For the number of different combinations, the following important 
lemma holds. 

Lemma 5. The number of distinct of e-GRj-^ 's consistent with e-GRj- is at most d^^~^^''^^'^\ 

Proof. For each variable x, there are 2(< d) possible values. For each clause G, let do{< d) be 
the number of subtrees created by splitting at p. There are three different cases. 

Suppose that G does not appear in any previous splitting node. This implies that G only 
appears in T, then there are do possible way of assigning values to the bit for G, s.t. exactly 
one of Ti's whose bit for G is set to 1. 

Suppose that G appears in some previous splitting nodes, and its value is fixed in c-GRt- 
If the bit for G is assigned 1, then there are do possible assignments to G similar as above, 
otherwise, the only possible way is to set all bits for G to 0. 

Suppose that G appears in some previous splitting nodes, but its value is unfixed. G must 
appear as unfixed in at least one subtree. Without loss of generality, we assume that G appears 
in subtrees 71, Ti • • • T^o, where cq > 1. The values of C in 71 , Ti • • • T^o ^^^^^ unfixed, so there 
are do — cq + 1 < do possible assignments of G in the subtrees T^q+i, • • • , T^q, the first one sets 
all to 0, and the i{> 2)-th one sets the bit of G in the subtree Ti+eo-i to 1 and the rest to 0. 

Since there are at most (1 — e)TW{(f)) unfixed values in p, the number of different combina- 
tions of e-GRj-- consistent with e-GRj- is at most d^^~''^^^^'^^ □ 

4.3 Warm-up: a tradeoff algorithm for a given path decomposition 

Here we are dealing with the simpler case where we are given a path decomposition of the 
incidence graph of a formula, together with the formula (the main complication occurs for tree 
decompositions). Algorithm [2] depicts the framework of a tradeoff algorithm, where T is a path 
decomposition, 5 is a set of previous splitting nodes, e-GRj- is an assignment for T with S. 
This family of algorithms is parameterized with (i) e and (ii) the splitting algorithm in linellUI 

Let VW{4') be the width of the given decomposition. The splitting algorithm chooses the 
node in the center of the corresponding path segment such that the splitting operation results- in 
two almost same-length segments. 

Each segment will contain at most 2 previous splitting nodes throughout this procedure. By 
Lemma [31 the number of assignments in each segment is at most 2'^^'^^'^\ Now, we consider a 
e-GRj^ of T. There are at most 2^'^^^('^) assignments Rj- G e-GRf- Number them from 1 to 
\e-GR-r\. Denote by MiT, e-GR-r) ^ 2'^^^'^^'^^ array, where the i-th entry indicates whether the 
i-th assignments of e-GRj- can be satisfied. M{T ^e-GRq-) can be computed by Algorithmic 
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This algorithm has structure similar to the recursive algorithm, but instead of fixing an 
assignment of two sub-segments it fixes only (1 — €)VW{4>) bits and recurses on the two com- 
ponents with two e-assignment groups. Intermediate results are stored to save time as in the 
dynamic-programming algorithm. 

Algorithm 2 SAT-hybrid (T, 5, e-GRr) 

1 
2 
3 
4 
5 
6 
7: 
8 
9 

10 



M{T, e-GR-j-) ^ all zero matrix 
if all nodes in T are previous splitting nodes then 
for j ^ 1 to \e-GPr\ do 

Let Rj- be the jth assignment in e-GRj- 

if all entries for a clause in Rj- assigned 1 are satisfied by some variables in T then 

M{T, e-GRr) j ^ 1 
end if 
end for 
else 



Split at the node returned by a splitting algorithm given T, S 



11: Denote the subtrees after the splitting as TI's 

12: for all e-group assignments e-GR-jr Vi, which are all consistent with e-GRj- by fixing 

(1 - e)r>V((/)) entries do 

13: Vi , M{Ti, e-GRr ) ^ SAT-hybrid(P, %, e-GR^ ) 

14: for j ^ 1 to \e-GPr\ do 

15: Let Rj- be the jth. assignment in e-GRj- 

16: for all Rr € e-GR^, Vi do 

17: if M{Ti,RTi) — 1'^^ ajiid RTi^ all consistent with Rj- then 

18: M(r, e-GRr) j ^ 1 

19: end if 

20: end for 

21: end for 

22: end for 

23: end if 

24: return M(T, e-GRr) 



The algorithm will recurse log|(^| times. In each recursive step, we only need 0(2^'^^^(''^)) 
bits to store the array M. Hence, we use space Q* (^2'^^^^(^)'J, At every step of the recur- 
sion, we need to call the algorithm recursively 0(2^^"'^)^^'^'^)) times. So, the running time is 

The correctness follows from the correctness of the recursive algorithm given in Section 13.41 
Basically, this algorithm enumerates all the assignments as in the recursive algorithm except 
that some intermediate results are stored to reduce running time. 

4.4 A tradeoff algorithm for a given tree decomposition 

The tradeoff algorithm for the given tree decomposition is again Algorithm [21 where we imple- 
ment line [10] differently than in the case of a given path decomposition. 

The main technical complication. The idea of splitting at the center node as in the previous 
section does not work any more. Such splittings may create subtrees with more than 2 previous 
splitting nodes. In fact, the way we dealt with this in Section [3.41 was not to do anything. But 
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Figure 4: Finding the splitting node in three different cases. 



now, we have to deal with assignment groups and thus the space requirement may be as bad 
as i}*[2'^'''^'^^^''), where c is the maximum number of previous splitting nodes in the recursion. 
Overcoming this issue requires a new idea. Implementing this idea results-in an algorithm 
with the property that throughout its execution the number of splitting nodes in the recursive 
procedure is < 2, or more generally a constant. 

Let a be a parameter satisfying < a < 1/2. A type£ tree is defined to be a tree with i 
previous splitting nodes. Here is a splitting algorithm T-L2 satisfying the restriction mentioned 
above. This specific splitting algorithm is an implementation of line [10] (i.e. replace the box in 
line [To] with Algorithm [3]) . 

Algorithm 3 Splitting algorithm ^2 with T, S as parameters 

1: if T with 5 is a typeo tree then 

2: return the 1/2-splitting node 

3: else if T with 5 is a typei tree then 

4: consider the previous splitting node as the root 

5: return the a-splitting node 

6: else if T with 5 is a type2 tree then 

7; suppose the two splitting nodes are pi and p2 

8; consider pi as the root and compute the 1/2-splitting node m 

9: if m is on the path between pi and p2 then 
10: return m 
11: else 

12: return the least common ancestor c of m and p2 
13: end if 
14: end if 



Putting everything together Utilizing this splitting algorithm, we obtain an algorithm for tree 
decompositions. Denote by Ti{N), T2{N) the running time of ^2 on typei or type2 tree each 
of N nodes respectively. 

Splitting a typei tree will result in multiple typei trees with size at most (1 — a)N and one 
type2 tree with size at most aN, so we have 

Ti{N) < 0((i(i~^)'^^('^))(ri((l-a)iV)+r2(aiV)) + 2^C^^('^)) 

Splitting a type2 tree, when the 1/2-splitting is on the path between pi and p2 will result in two 
type2 trees with size at most A^/2 and multiple typei trees. Otherwise, the splitting operation 
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will result in two type2 trees with size at most N/2 and several typei trees. So, we have 

T2{N) < 0(d(^-^)'^^('^))(ri(iV) + r2(Af/2)) + 2°C^^W) 

By solving these recurrences, the running time and space of the hybrid algorithm can be 
summarized as follows. 

Theorem 2. SAT of tree-width TWi<p) can be solved in simultaneously 0*(di-44i(i-^)r>V(<}i)iog7V^ 
time and O* {2'^^'^^^'^^) space, where e is a free parameter, < e < 1. 

Proof. Set a = — we have 

Ti{N) < 0(d(i-^)'^(<^))(ri((l-a)iV) + r2(aiV)) + 2°('^^W) 

< 0(d(i-^)^>^W)ri ((1 - a)N) + 0(d2(i-^)^W)Ti (aN) + 20(^^W) 

Therefore, we have 

Since typci, i > 3 trees are not allowed, the space requirement is 0*{2'^''^'^^^'') □ 

Optimality of the splitting procedures The splitting algorithm presented above is a specific one 
without creating typei,\/i > 3 trees. Actually, it can be shown that this specific splitting 
algorithm is optimal modulo our technique. 

Definition 6. Denote by 2tc (Vc >2) the family of algorithms for SAT with bounded tree-width 
following the framework in Algorithm which use a splitting algorithm without creating typei 
trees Mi > c. 

We lower bound the running time of all algorithms in %2- The hard instance comes from 
fibonacci trees. 

Definition 7. For any positive integer h, a /i- fibonacci tiee (denoted as F^) is recursively defined 
as following, 

(1) if h = l, Fh contains only 1 node; 

(2) if h = 2, Fh contains 2 nodes and one edge between them; 

(3) if h> 2, Fh is constructed by a root connecting two subtrees Fh-2 and Fh-i- 

An extended (/i, r)-fibonacci tree (denote as F^^) is constructed by adding one edge between the 
root r and the root of subtree F^ ■ 




Figure 5: An h- fibonacci tree (F^). 

An /i-fibonacci tree is indeed the worst-case for splitting algorithms without creating typci 
trees > 3. Formally, we have the following lemma. 



15 



Figure 6: Illustrations of two cases for the proof of Lemma [TJ 



Lemma 6. For each h > 1, in an extended (/i, r)-fibonacci tree, suppose r is a splitting node, 
let N he the size of the tree. Then every splitting algorithm which does not create typci trees 
Vi > 3 runs in Jl*(di-44i(i-.)r>V(^)iog7V) ^-^^^ 

Before getting into the proof, we define two special typei and type2 trees : 71, h and 72,h- A 
Ti^h tree is constructed by a splitting node connected to a subtree Fh, and a Ti,^ tree constructed 
by two splitting nodes connected to another node which has a subtree F^. 

Claim 1. The lower bound of processing time for tree Ti^h is Q* (d^^~^^'^'^^^^'^) , and for tree 
T2,h, the lower bound is Q*(d(i-^)'r>V(<^)(/i+i))_ 

Proof. We proceed by induction. Base cases are trivial where h <2. Suppose the statement is 
correct for any ho < h. For tree 71, by inductive hypothesis, if we split at the root of Fh, the 
processing time is at least if we split at some node inside the subtrees 

Fh-i or Fh-2, the processing time is at least ^* {^S-^~'''^'^'^^'^'^^^^^^~'^'^^^^) . So the lower bound for 
tree T{^h is {d^^~'''^'^^'^'^^^) . For tree 72, we must split at the node connecting two splitting 
nodes, so again by inductive hypothesis the lower bound is Q,* {2^^~'^)'^^^'t>)^^+^)) . □ 

Proof. (Proof of Lemma [6]) Set a = , since the number of nodes in F^ is Q 
we have a lower bound for running time $7* ^(i-i°g(i-°=) ^""^ e)T>V{</>)logAf^ ^ |— I 

Combining the above lemma and the algorithmic result, the following theorem can be proved, 
which states that our algorithm is optimal within 2I2. 

Theorem 3. For fixed e, < e < 1, the running time of an optimal algorithm in 2I2 is 
Q* ^^lA41{l-t)TW(<f>) logN\ 




4.5 Generalized tradeoff algorithm for a given tree decomposition 

It is natural to ask if the algorithm can be generalized to allow up to typcc trees for some c > 3. 
Indeed, as c increases the running time decreases while the space requirement increases. If we 
want the tradeoff to make sense, the parameter e must be restricted to some specific range. 

First, we need to generalize the splitting algorithm to allow typei trees for i up to c. For 
arbitrary 1 < i < c, consider splitting a typci tree: suppose the splitting node is p. If p is 
on the path between some pair of previous splitting nodes, splitting at it will result-in several 
typej{j < i) trees, otherwise, splitting will result-in several typei trees and one typei+i tree. 
Formally, when splitting a typci tree, we invoke algorithm Tic to determine the splitting node, 
which can be seen as an implementation of line [TO] in Algorithm [2j 
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Figure 7: Finding the splitting node in three different cases. 



Algorithm 4 Splitting algorithm T-Lc with T, 5 as parameters 



1: if T with 5 is a typcQ tree then 

2: return the 1/2-splitting node 

3: else 

4: suppose T with 5 is a typei tree 

5: if the number of nodes in T is less than 2^~* then 

6: return the 1/2-splitting node 

7; else 

8: arbitrarily pick a previous splitting node as root 

9: compute a ac,i-splitting node qi 

10: if qi is not on the path between any pair of previous splitting nodes then 

11: return qi 

12: else 

13: compute a 1/2-splitting node q2- 

14: if q2 is not on the path between any pair of previous splitting nodes then 

15: return the least common ancestor of q2 and all previous cutting nodes 

16: else 

17: return q2 

18: end if 

19: end if 

20: end if 

21: end if 



Each ac,i for any 1 < z < c is a parameter satisfying < ac,j < 1/2. To prevent typec+i 
trees, splitting nodes of typcc trees must be on the path between some pair of existing splitting 
nodes, this is assured by setting ac,c = 0. For a fixed c, the running time and space of the 
algorithm solving SAT of bounded tree- width utilizing the splitting algorithm A are summarized 
in Theorem [T]^see page [TT]) . The following lemmas are the building blocks used to conclude the 
theorem. 

Lemma 7. For every c > 2, tree T with N nodes and splitting nodes S, let Dc |5|(A^) = 
max7-{SDc(T)}. Then for each 1 < i < c, 

D.^N) < max{D,^i ((1 - ac,i)N) , D,^,+i {a^^iN) , D,^, {N/2)} + 1 

and 

DcAN) < max{L>,,i(iV),Z),,c(iV/2)} + l 
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Proof. Without loss of generality, suppose N > 2'^. Consider splitting a typei tree with splitting 
nodes S, 1 < i < c. If the ac, j-splitting-node m is not on the path between any pair of previous 
splitting nodes, splitting at m will result in multiple typei trees of size at most \{l — ac,i)N~\ and 
one typci^i tree of size at most [ac,j-^l- Otherwise, since 1 — ac,t > 1/2, the maximal possible 
size of a typei tree created by any splitting node will not exceed [(1 — ac,i)N~\. Splitting at the 
1/2-splitting-node c will result in multiple typej{j < i) trees of size at most otherwise, 
splitting at the least common ancestor of c and all previous splitting nodes as p, will result in 
multiple typei tree of size at most [(1 — occ,i)NA^ and many typej{j < i) trees with size at most 
. In summary, 

DcAN) < max{L>,,i ((1 - ac,^)N) , 1)^,^+1 iac,^N) , D,^i {N/2)} + 1 

Now, consider splitting a typec tree with splitting nodes S. Since ac,c = 0, we always ignore 
the (1 — etc, j)"Splitting-node m. Splitting at the 1/2-splitting-node c will result in multiple 
typej{j < i) trees of size at most [A^/2]. Splitting at the least common ancestor of c and all 
previous splitting nodes will result in multiple typei tree with size at most and multiple 
typej{j < i) trees with size at most [A^/2]. Namely, 

Dc,c{N) < max{Z)e,i(iV),I)c,c(iV/2)} + 1 

□ 

Lemma 8. For any c > 2, the c-splitting depth of a tree with N nodes by A is XdlogN — c) + 
c + 0(l). 

Proof. Let D'^-{N) be a function satisfying the following equations, 

= D;i((1 - a^,i)N) + 1 = D'^^^^^a^iN) + l,for 1 < i < c 

For each 1 < i < c, we have D'^-{N) = D^^((l — ac,j-i)-^/oc,j-i)- For each 1 < i < c, since 
^c,(^) = ^c,i((l - «c,)iV) + 1, D'^^^iN) = Z?;,+iK.iV) + 1 = D'^^,{a,,il - a,,+i)N) + 2, 
thus, 

D',^,{N) = D',^dac,i{l - ac,i+i)/{l - ac,i)N) + 1 

Since D'^ ^N) = D'^ -^{{1 - ac,i)N) + 1, we have (1 - ac,i)(l - ac,i)N = ac,j(l - Oc.i+i), and 
acA+i = 1 - (1 - ac,i)(l - Oic,i)/acA. Therefore, 

Qc,l(l - Qc.l)' 
~ 2ac,i - 1 + (1 - aciY 

On the other hand, D'^^^N) = D'^^dN) + l = D'^ ^^l- ac,c~i)N/ac,c-i), so we have Qc,c-i/(1- 
ac.c) = 1 - Oci and ac,c-i = (1 - ac,i)/(2 - ac,i)- Thus, (1 - ac,i)/(2 - Oci) = 1 - 

By setting Ac = iog(i-»,i) ' ^^^^ 1 < « < c, D'^ ^iN) > D'^^i{N/2) + 1. And then 
D'^^,{N) = max{I);i((l - a,,)N), D'^^,^,{a,,N), D'^^,{N/2)} + 1 

and 

D'{N) = max{D'{N),D'{N/2)} + 1 
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Combining with the previous lemma, it can be easily proved by induction that D'^-{N) upper 
bounds Dc^i{N) for all c, i. Specifically, 

DcAN) < D',^,{N) < A,(log - c) + D,,i{2') + 0(1) 



Since Dc,i(2'^) = c, the c-splitting depth of a tree with N nodes by the algorithm A is upper 

i_ 

bounded by Ac(log — c) + c + 0(1), where Ac satisfies J2i=i 2 = 1. □ 



Proof. (Proof of Theorem[T|) For every c > 2, we solve the above recurrences where N < 2'^. The 
running time is O* (^d^ogN{i-e)TW{<p)-^ when iV > 2^ the running time is O* (^SMi°^N-c)+c){i-e)TW(cf,)y 

The space is upper bounded by 0*(2'^'^''~'^(''^)) since only typei{i < c) trees are allowed. □ 

Furthermore, it can be proved that the space resource can be fully exploited to minimize 
the running time, which is of practical importance. More specifically, the following holds true. 

Corollary 2 (of Theorem [1]) . For any e' > there exists an algorithm which runs in space 
0*(2^"^W) and time 0*(d™(^) ^"§2 l-^l) time for a constant 6 <1. 

Lemma 9. Ac < 1 + 4r 

23 

Proof. Let f{X) = X'^ — Yl'i=0'^^-> then 7c is the root of f{X) = with largest absolute value. 
We know /(2) = 1 > 0, so if we can prove /(2 — ^) < then there must be a root between 2 

and 2 \r. Denote y = 2 \r, 

23 " 23 

/(y) <Q ^ y-<j2y' = ^ y < 2 - 1 

^ y-1 t 
The last inequality is true because y = 2 — > y/2 when c > 2 and 2 — ^ > 2 — = y. By 

^^ = i^'^^<i + A- ' ' □ 

Given the upper bound on Ac, the corollary can be proved as follows. 

Proof. (Proof of Corollary [2]) For fixed e and c, by Theorem (H there is an algorithm with 
running time o*((i-^=(^-^)i°S2^'^^(''^)) and space 0*(2^^'^('^)) for any e > 0. Set e = f , then 

the space is 0*(2^"^W) and the running time is 0*(d^=(^-^) ^■^('^)). By Lemma El 
Ac(l — ^) < (1 + ^)(1 ~ 7) < 1 when we pick a large enough c. □ 

Remark 3. Given a tree T, any algorithm A avoiding typcc+i trees with splitting depth 
SDc(^,r,0) requires O*((i(i-^)SDc(Ar,0)rw{0)) ^^^^ (9*(2«rwW) ^^^^g^ 

Optimality Similarly to the second half of the previous section, we also prove the optimality of 
the generalized tradeoff algorithm. We construct the hard instance using generalized fibonacci 
trees. 

Definition 8. For any integer c > 2, and a positive integer h, a (c, /i)-fibonacci tree(denoted 
C"'' Fc,h) is defined as by one of the rules, 

(1) if h < c, Fc.h is a chain of 2^^ nodes; 

(2) if h > c, Fc^h is constructed by starting from a chain of c nodes, then replacing the ith node 
by a subtree F^h-i- 
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An extended (c, /i, r)-fibonacci tree (denote as F*^^) is constructed by connecting one root node 
r to a subtree Fc^h- 

See Figure [9] for an illustration of a (c, /i)-fibonacci tree. A (c, /i)-fibonacci tree is indeed the 
hardest input of the splitting algorithm. To be more specific, the following lemma holds. 

Lemma 10. For each h>l, MSD(F*^^^, {r}) > h. 

Proof. For any c > 2, h > c and 1 < w < c, the tree Gc^h,w is defined as follows, first construct 
a chain of length w, connect c — w+1 splitting nodes to the first node of the chain, and connect 
a subtree Fc^h-c+w-i to i-th node of the chain. Denote as the set of the £ splitting nodes 
connected to the first node of the chain. We prove that MSD{Gc,h,w, Sc~w+i) > h — c + w, 
which implies the inequality that we need. Specifically, MSD c{F* ^ {r}) = MSDc{Gc^h,c, Si) > 
h — c + c > h. 

The inequality is proved by induction on h. The basic case is trivial. Suppose for any 
h < ho, MSDc{Gc,h,c,Sc-w+i) > h-c + w. Now we prove MSDc{Gc,ho,c, Si) > ho - c + 
w by induction on w. When w = 1, to prevent typei tree for i > c, we must split at the 
first node of the chain. Therefore, MSD c{G c,ho,i i Sc) = 1 + MSD c{Gc,ho-c,c: Si) > ho — c + 1. 
When w > 1, if the splitting node is in the subtree Fc^h^-c+w-i connected to the first node 
of the chain, MSDciGc,ho,w, Sc-w+i) > 1 + MSDc{Gc^ho,w-i, Sc-w+2) > ho - c + w, otherwise 
MSDc{Gc,ho,w, Sc-w+i) > 1 + MSDc{Gc,ho~c+w^i,c, Si) = ho - c + w. □ 




Theorem 4. For every c > 2 and N > 2^, there exists a tree T with N nodes, such that the 
c-minimal splitting depth of T MSDc(T, 0) is at least Ac(log N — c) + c — 0(1). 

Proof. Let |-Fc,/i| be the number of nodes in the tree -Fc,h- For any h < c, we have |-Fc,h| < 2'^, 
when h > c, we have |-Fc,fc| = Yli=i \ Fc,h-i\ + c. By the recursive relation, generating function 
of \Fc^h\ can be written as f{X) = X"" - J2i=o^^- Therefore \Fc^h\ = YTi=i^c,acj'' 1 where 6c,i 
is at most constant times of 2'^ and 7c, i is the i-th root of the equation f{X) = 0. 

Let 7c = argmaxi{|7c,i|}. When /i tends to infinity, \Fc,h\ = e(2'^7,^"^). So, h > log^^ (|Fc,/,|/2^)+ 
c — 0(1) = Ac(log |-Fc,h| — c) + c — 0(1). Therefore, for any c > 2 and N > 2*^, there exists 
a tree T with N nodes, such that the c-minimal splitting depth of T MSDc(T, 0) is at least 
Ac(logiV-c) + c-0(l). □ 
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Figure 9: A {c, h) -fibonacci tree (Fc^h)- 



Similarly to Theorem [3l we conclude the optimality of our tradeoff algorithm, namely, for 
fixed c > 2, e, < e < 1, any algorithm in 21^ runs in J7*(3Ac{iogAf-c)+c-0(i))_ 

4.6 An algorithm optimal in our framework 

As mentioned in Remark [3l for fixed c > 2, an optimal splitting algorithm, i.e. with smallest 
splitting depth implies an optimal algorithm in 2tc. Indeed, the following lemma assures that 
such an optimal splitting algorithm can be computed in quasi-polynomial time. 

Lemma 11. For a tree T with N nodes, MSDc(r, 0) can be computed in time O* (iV^^SDc(T,0)^ 
and polynomial space. 

Proof. For any h, whether MSDc(T, 0) < h in 0*{N'^) can be tested by a branch-and-bound 
algorithm. For any tree T and previous splitting nodes S, enumerate all nodes and check 
whether the minimal splitting depth of each subtree is at most h — 1 recursively. The maximal 
depth of the recursion is set to h, thus the running time is O* (^N^^ . Therefore, the overall time 
is at most O* (A^i^SDcCT,©)^ ^ The required space is polynomial. □ 

By applying the splitting algorithm in the previous lemma we obtain the following tradeoff 
algorithm. 

Theorem 5. For any c > 2, any e satisfying < e < 0, the hybrid algorithm can be done in 
time O*(d(i-^)^^SD.{T,0)rwW) ^p^^g o*(2-'^('^)). 

5 Some remarks on the complexity of bounded width SAT 

In this section we separate the complexity of the tree- width parameterized from the path- width 
parameterized SAT for the same width value, and we initiate the study of the incompressibility 
and non-sparsification of width-parameterized SAT instances. 

More preliminaries and notation For the first part we proceed by giving a machine characteriza- 
tion of SATtw(u^(?^)); the problem of deciding SAT where the CNF formula is given together 
with a tree-decomposition of width w{n), where n is the input length. |Pap09| gives a machine 
characterization of SATpw(u)(n)); the corresponding problem for a given path decomposition. 
We define NL [r(n)] to be the class of problems decidable by a log-space machine equipped 
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with a read-only non-deterministic, polynomially long tape, where the machine makes < r{n) 
passes (as the head reverses) over the witness tape. It was shown that SATpw(r(n) log n) is 
complete for NL[r(n)] under many-to-one logspace reductions. Also under the ETH NL[a;(l)] 
is incomparable to P. The ETH states that 3-SAT on A'^ variables cannot be decided in time 
2°(^). Further study shows that under the ETH together with the assumption that NP is not 
contain in some fixed polynomial space bound, as r(n) grows from constant to polynomial there 
is a strict hierarchy of classes that grows from NL all the way up to NP. 

We strengthen the assumption that NP ^ coNP to NL[a;(l)] ^ coNP. In fact, we assume 
further that 

NL[w(l)] 2 coNP/poly 

A complexity theoretic study of this assumption is interesting on its own right, and it is left 
for future work. Here are some indications on it validity: (i) the belief that NP ^ coNP is 
because usually people think that in fact the required certificate size blows up to exponential 
(not merely super-polynomial) - i.e. some kind of exhaustive enumeration is required - and (ii) 
given a non-uniform advice of polynomial size won't help either (in particular, an easy extension 
of Karp-Lipton shows that if NP C coNP/poly the polynomial hierarchy collapses). 

5.1 Tree-width parameterized SAT is harder than path-width parameterized SAT 

We characterize SATtw in terms of non-deterministic space bounded machines, that run in time 
polynomial, and in addition they are equipped with an unbounded stack |Coo71j . NAuxPDA(s(n), t(n)) 
is the class of decision problems decidable by such a machine in space 0{s{n)) and time 0{t[n)). 
SAC(s(n), t(n)) is defined as the semi-unbounded boolean circuit of size 0{s{n)) and depth 
0{t{n)), in which the AND gates have constant fan-in and all negations are at the input level. 
We have that SAC{logn,nOW) =: SAC\ and NC^ C SAC^ C AC^ 

The following theorem is a new characterization of SATtw The proof of this theorem is 
very different than the path-width characterization theorem of |Pap09| . 

Theorem 6. SATtw(log'n) is /lard /or NAuxPDA(log' n, n'^(^)), VZ G Z+, under O (log'' n)- 
space may-to-one reductions. 

Proof. By the characterization theorems from |Ven87j and |Ruz80j . it can be shown that 
NAuxPDA(log^ n,poly(n)) C SAC(2'^('°s' '"), log' n) by O(log'n) space bounded transforma- 
tions. Now it suffices to construct a SATtw(log'ra) instance (/), given the SAC(2^(i°s "),log'n) 
circuit C and an input x, such that <j) is satisfiable if and only if C evaluates to 1 on x. 

Our construction generalizes the observations in |GLS01j . Without loss of generality we 
assume that the circuit has the following normal form: 

(1) Fan-in of all AND gates is 2. 

(2) The circuit is layered. 

(3) The circuit is strictly alternating, odd-layer gates are OR, even-layer gates are AND. 

(4) The circuit has an odd number of layers. 

(5) NOT gates only appear in the bottom layer. 

A proof tree is a tree with the same layering as the circuit. Each node of the tree is labeled 
by a gate from the corresponding layer of the circuit. At an odd layer, each node has one child, 
while at an even layer, each node has two children. Two connected nodes must be labeled such 
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(a) A semi-unbounded circuit with a proof tree 
highlighted. NOT gates are hidden 



(b) The skeleton of the proof 
trees 



Figure 10: In llObI a SATtw(log^n) instance is constructed from the skeleton: each node corre- 
sponds to O(log'n) boolean variables; clauses are constructed for each dashed circle; and only 
those variables corresponding to a node shared by different dashed circles must be put into a 
bag in the tree decomposition, which ensures 0(log' n) tree width. 

that the corresponding gates are connected. At the bottom layer, each node must be labeled 
by an input gate or a NOT gate which outputs value 1. See Figure [TOal for an example. 

A proof tree can be viewed as the witness that a circuit evaluates to 1 on the given input. 
One may observe that, since we assume that all the circuits are of normal form, every proof tree 
must have the same shape. The tree of the same shape of a proof tree without labeling is called 
a skeleton(see Figure [lObp . Showing that the circuit evaluates to 1 on an input is equivalent to 
giving a labeling satisfying the proof tree conditions to the skeleton. 

One of the 2*-^^^°^ '^^ gates in the circuit can be indexed by a 0(log' n) bit binary string. For 
each node v in the skeleton, assign a variable Xy using space 0(log' n), to indicate the index of 
the gate that this node be labeled. This variable is also seen as a group of O(log'n) boolean 
variables. For each pair of connected nodes u, v in the skeleton, and for each pair of possible 
indices a, b, which can be assigned to and x^, correspondingly satisfying the conditions of a 
proof tree, create a clause encoding Xu = a A x^ = b. All these clauses form an SAT instance 
whose incidence graph has a tree decomposition with tree width 0(log' n) (See Figure [TOl for an 
example and illustration). □ 

Corollary 3. SATtw(logn) is hard for SAC"*^, under log-space many-to-one reductions. 

This corollary follows by the characterization in [Ven87] . 

Therefore, under the standard assumption that NL C SAC^ we separate the complexity 
of SATpw(logn) and SATtw(logn). How about higher width values? In this case we do not 
have well-established higher analogs of NL C SAC^. However, the characterization of Theorem 
[6] together with the main theorem in |Pap09| can be understood in the other direction. This 
means that the conjectures about the corresponding complexity classes may be true exactly 
because one may conjecture that SATt:^^{w{n)) is harder than SATp^{w{n)). It seems that 
there is much more to be done towards this direction. 

5.2 Incompressibility and non-sparsification 

By compressibility of SAT instances we mean that the input instance or parameters can be 
reduced by an efficient algorithm, in a way that the compressed instance preserves the satisfia- 
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bility. Let us start with some prehminary observations stating that no non-trivial compression 
can be done to reduce the width parameter. 

Suppose we have an instance (p together with an optimal tree decomposition of width 
TW((/>) = w(log \(j)\), and assume that there is a procedure running in polynomial time, which 
constructs a new instance (p' with tree-width TW{cl)') = ^TW((/)) such that (/)'. If 

we repeat this procedure for logTW{<j)) times, we will obtain an instance (j), where 'TW{4>) is 
constant and has the same satisfiability as (j). Satisfiability of (p can be determined in polynomial 
time, and the transformation can also be done in polynomial time, therefore we can determine 
satisfiability of (p in polynomial time. Under ETH, this is not possible, which implies that 
determining satisfiability of (p requires 2^^^^^'^)^ time. Namely, such a procedure cannot exist. 

Next we turn to the question of interactively "compressing" the instance length. The litera- 
ture refers to this process as sparsification. [DvMlOj shows that in order for a polynomial-time 
bounded machine to decide 3-SAT with the help of an unbounded oracle, if the number of bits 
needed to be sent to the oracle is ©(re^"*^), where n is the number of variables and e > 0, then 
NP C coNP/poly. Let L be a language, denote OR(L) be the problem of asking whether one 
of the input instance belongs to L. The following lemma is crucial for the proof. 

Lemma 12 ( [DvMlO] ). Let L be a language, with instance size s and t : Z"*" — t- Z"*" be poly- 
nomially hounded s.t. the problem of OR(L) with t{s) instances can be decided by sending 
0{t{s) log t{s)) bits, then L e coNP/po/y. 

Applying the same technique, we obtain the non-sparsification of 3-SAT pw(tL'(n)) instances, 
w{n) = f^(log n). 

Lemma 13. // a 5-SATpw(u;(n) log n) instance can be decided by sending 0{n^~'') bits to the 
oracle, then NL[tM(n)/ log n] C coNP/po/y. 

Proof. Obviously, s is at most polynomial in n. Consider an OR(3-SATpw(tL'(n))) instance, 
which contains t{s) 3-SATp^{w{n)) instances each with n variables can be represented by a 
3-SATpw(tL'(n)) instance with t{s)s variables. Assume all the instances use different variables. 
Suppose the instances are (pi, Mi, and each has a corresponding path-decomposition Vi, variables 
Vij, clauses Cjj. Simply joining path-decompositions will impose an AND-relation instead of 
an OR-relation. To impose an OR-relation, first join the path-decompositions sequentially, let 
a be a selector, for each Vi, replace each clause Cjj by a clause representing (a = i) — )■ Cij = 
{a = i)\/Cij. a can be implemented by 0(logt(,s)) = O(logn) variables appearing in every bag. 
One last step is that each newly created clause is of O(logri) variables, to break each of them 
into clauses of 3 variables by standard technique, 0{w{n) logn) variables need to be introduced. 
In the end, we constructed a 3-SAT instance with path-width w{n)logn. Now by hypothesis, 
this instance of t{s)s variables can be decided by sending (i(s)s)^~'^ bits, by Lemma [T2\ this 
means 3-SATpw(w(n)) is in coNP/poly, and by the characterization mentioned before, the 
lemma follows. □ 

6 Conclusions 

We devised a simple algorithm for deciding the satisfiability of arbitrary CNF formulas, that 
runs in time 2'^^"^^^^^°^^^^^ and space polynomial. We conjecture that doing asymptotically 
better in the exponent blows up the space to exponential in the tree-width. Our main technical 
development is a family of deterministic algorithms that achieve tradeoffs by trading constants 
in the exponent between space and running time. 

One issue we did not discuss is whether randomness can be helpful towards better algorithms 
for tree-width bounded SAT. Take for example Schoening's algorithm |Sch99j for 3-SAT, which 
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achieves running time a", for a constant a < 2 and space polynomial. Intuitively, this algorithm 
is oblivious to any structure the given CNF may have. An interesting question is finding a way 
to exploit the small width structure using randomness. 

This paper also initiates an in-depth study of the computational complexity of width- 
parameterized SAT. One possible direction is understanding the implications of our conjecture; 
i.e. developing machinery that leads to either proving or disproving it. Another interesting re- 
search direction is to obtain stronger sparsification results than those obtained in Section [5j The 
main technical obstacle towards this goal is that the packing lemma of [DvMlOj does not apply 
in the setting of bounded-width SAT. The reason is that even the first step in the construction 
of |DvM10j blows up the tree-width from any value (e.g. log'^ n) to linear. That is, following 
their work the given formula is right away transformed into a formula of huge tree-width. We 
believe that proving non-sparcification 0{n) is possible, and it seems to require the development 
of new non-sparsification tools. 

In general, understanding various aspects of the computational complexity of width-parameterized 
SAT is an issue left open for future research. 
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